Speaker-Specific Biomechanical Model-Based Investigation of a Simple Speech Task Based on Tagged-MRI

نویسندگان

Keyi Tang

Negar M. Harandi

Jonghye Woo

Georges El Fakhri

Maureen Stone

Sidney Fels

چکیده

We create two 3D biomechanical speaker models matched to medical image data of two healthy English speakers. We use a new, hybrid registration technique that morphs a generic 3D, biomechanical model to medical images. The generic model of the head and neck includes jaw, tongue, soft-palate, epiglottis, lips and face, and is capable of simulating upper-airway biomechanics. We use cine and tagged magnetic resonance (MR) images captured while our volunteers repeated a simple utterance (/@-gis/) synchronized to a metronome. We simulate our models based on internal tongue tissue trajectories that we extract from tagged MR images, and use in an inverse solver. For areas without tracked data points, the registered generic model moves based on the computed muscle activations. Our modeling efforts include a wide range of speech organs illustrating the coupling complexity between the oral anatomy during simple speech utterances.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

Although, speech recognition systems are widely used and their accuracies are continuously increased, there is a considerable performance gap between their accuracies and human recognition ability. This is partially due to high speaker variations in speech signal. Deep neural networks are among the best tools for acoustic modeling. Recently, using hybrid deep neural network and hidden Markov mo...

متن کامل

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

A variety of methods are used for speaker adaptation in speech recognition. In some techniques, such as MAP estimation, only the models with available training data are updated. Hence, large amounts of training data are required in order to have significant recognition improvements. In some others, such as MLLR, where several general transformations are applied to model clusters, the results ar...

متن کامل

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

متن کامل

Recognizing the Emotional State Changes in Human Utterance by a Learning Statistical Method based on Gaussian Mixture Model

Speech is one of the most opulent and instant methods to express emotional characteristics of human beings, which conveys the cognitive and semantic concepts among humans. In this study, a statistical-based method for emotional recognition of speech signals is proposed, and a learning approach is introduced, which is based on the statistical model to classify internal feelings of the utterance....

متن کامل

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

Quality of speech signal significantly reduces in the presence of environmental noise signals and leads to the imperfect performance of hearing aid devices, automatic speech recognition systems, and mobile phones. In this paper, the single channel speech enhancement of the corrupted signals by the additive noise signals is considered. A dictionary-based algorithm is proposed to train the speech...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2017

Speaker-Specific Biomechanical Model-Based Investigation of a Simple Speech Task Based on Tagged-MRI

نویسندگان

چکیده

منابع مشابه

شبکه عصبی پیچشی با پنجره‌های قابل تطبیق برای بازشناسی گفتار

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

Speaker Adaptation in Continuous Speech Recognition Using MLLR-Based MAP Estimation

Recognizing the Emotional State Changes in Human Utterance by a Learning Statistical Method based on Gaussian Mixture Model

A New Method for Speech Enhancement Based on Incoherent Model Learning in Wavelet Transform Domain

عنوان ژورنال:

اشتراک گذاری